Soft errors handling in EEPROM devices

ABSTRACT

Soft errors occur during normal use of a solid-state memory such as EEPROM or Flash EEPROM. A soft error results from the programmed threshold voltage of a memory cell being drifted from its originally intended level. The error is initially not readily detected during normal read until the cumulative drift becomes so severe that it develops into a hard error. Data could be lost if enough of these hard errors swamps available error correction codes in the memory. A memory device and techniques therefor are capable of detecting these drifts and substantially maintaining the threshold voltage of each memory cell to its intended level throughout the use of the memory device, thereby resisting the development of soft errors into hard errors.

This is a division of application Ser. No. 07/886,030 filed May 20,1992.

TECHNICAL FIELD

This invention relates generally to error handling in semiconductormemories. In one specific aspect, this invention relates to improvingthe reliability of memory devices. In another specific aspect, thisinvention is related to continual detection and correction of potentialdata errors in memories such EEPROM or Flash EEPROM.

BACKGROUND OF THE INVENTION

While mass data backup storage in data processing systems has so farrelied on magnetic disk drives, the relatively high failure rate ofthese devices, their fragility, bulkiness and high power consumption(all the results of the devices' heavy dependence upon high precisionmoving mechanical parts) have led the industry to seek replacementstherefor.

One of the heavily pursued candidates is semiconductor memory. Whileseveral types of semiconductor memories exist, not all of them canfeasibly be used for mass storage of data. For example, random accessmemory (RAM), being a volatile memory requiring constant supply ofelectrical power to maintain its memory, is more suitable to be used astemporary working storage and not for mass data backup. And while readonly memory (ROM), programmable read only memory. (PROM) andultra-violet programmable read only memory (UVPROM) are non-volatile,the impossibility, or difficulties, in altering their contents haverendered these memories unsuitable for backup mass data storage.

Recently, an interest has been developed in using electrically erasableprogrammable read only memory (EEPROM) and Flash EEPROM for mass datastorage.

EEPROM and Flash EEPROM, and the implementation thereof to serve as massstorage and replace magnetic disk drives, are disclosed in pending U.S.patent application, Ser. No. 337,566 of Harari et al., filed Apr. 13,1989, and co-pending U.S. patent application, Ser. No. 422,949 of Grosset al., filed. Oct. 17, 1989, both of which have the same assignee asthe present application.

Essentially, EEPROM or Flash EEPROM are field effect transistors eachwith an additional polysilicon region generally referred to as thefloating gate. Data is "memorized" through confinement of predefinedamounts of electric charge in this floating gate.

The electric charge are transferred to the floating gate from thesubstrate through a dielectric region. They affect the conductivity ofthe source-drain channel and the threshold voltage of the field effecttransistor. Physically, the differences in threshold voltages and thedifferences in the source-drain currents, due to the confinement ofdifferent amounts of electric charge in the floating gates, can then beused to define different logic states (e g. "0", "1", . . . ) .Demarcation threshold voltage levels may be used to demarcate betweenthe different logic states. For example, a "0" or "1" state wouldrespectively have a programmed threshold voltage level less than orgreater than the demarcation threshold voltage level between these twostates.

Thus each memory cell is capable of supporting a range of thresholdvoltage levels within a "threshold window" spanned by a maximum andminimum programmable threshold voltage level. Schematically, thethreshold window may be partitioned into threshold voltage domains, eachrepresenting a memory state. Each domain may be defined by a pair ofdemarcation threshold voltage levels. In practice, a given memory stateis represented by programming a threshold voltage level well within itscorresponding domain, preferably located in the middle, with equalmargins on either side of the pair of demarcation levels

Traditionally, EEPROM and Flash EEPROM are used in applications wheresemi-permanent storage of data or program is required but with limitedreprogramming. But as EEPROM and Flash EEPROM are now intended toreplace magnetic disks, a new requirement surfaces--the requirement tomaintain reliability and availability with increased program/erasecycles.

As with most devices, EEPROM and Flash EEPROM are susceptible to defectsand failures. One consequence is the occurrence of soft errors caused bythe gradual shifting of the threshold level of the memory states. Theshifting of the threshold level is partly due to ambient conditions andmostly due to stress from normal operations of the memory device such aserase, program or read. As discussed earlier, a cell's threshold levelis typically programmed with a margin from the demarcation levels. Whenthe threshold level is shifted from its programmed level, thereliability of reading the intended memory state may be compromised.These soft errors, in their initial stages, are not severe enough to bereadily detected during normal operations of the memory device. However,if the shifting is allowed to continue beyond the error margin allowedby the normal read circuit, the soft errors will eventually develop intohard errors and produce read errors. In that event, the hard errors maybe corrected by some sort of error correction scheme such as an errorcorrection code (ECC), and the cells in question may be mapped out.However, if too many hard errors had developed at the same time, theerrors may be uncorrectable because they overload the capacity of theerror correction scheme used. Thus, the capacity of the memory devicemay gradually be decimated, and worse still, possible uncorrectableerrors can render the memory device unreliable.

The above identified failures are unfamiliar to engineers and scientistsworking on other semiconductor memories. For example, whereas DRAM mayalso suffer from failures due to charge leakage, such leakage ispredominantly the result of bombardment by alpha particles. Thus, DRAMfailures are instantaneous, unpredictable, random and independent of theprogram/erase circles. On the other hand, EEPROM failures are generallygradual, predictable and depend upon the number of times a memory iserased and programmed. Moreover, whereas DRAM failures are isolated,EEPROM failures are aggregative, as a group of cells may be subjected torepeated memory operations. When failures occur in aggregate, they mayoverload known error correction schemes.

The different characteristics of the failures between DRAM and EEPROMthus demand prevention techniques that are completely different fromthose available to DRAM designers.

Co-pending U.S. patent application, Ser. No. 337,566 of Harari et al.discloses a scheme of detecting and handling errors "on-the-fly" byverifying each memory operation, such as read, program or erase, afterits has been performed. A verification failure indicates a defectivecell, and the address of a defective cell is re-mapped the address of agood one. Similarly co-pending U.S. patent application, Ser. No. 422,949of Gross et al. discloses a scheme of detecting and handling errors"on-the-fly."

These error detection schemes are based on detecting errorscell-by-cell, by sensing abnormal read, program or erase operatingconditions of the memory devices. However, they are not effective inpreventing soft errors which may later deteriorate into catastrophicerrors. For example, a small voltage drop caused by a soft error mayescape detection by the schemes disclosed in these co-pendingapplications because it is still within the margin of error allowed bythe read, program or erase operations. But a group of cells may continueto deteriorate until they cross the margin of errors together. At thatpoint, remedies may be unavailable.

What is needed is an error correction scheme that is capable ofpreventing and correcting mass data deteriorations.

Accordingly, it is an object of the invention to provide a Flash EEPROMor EEPROM device with improved reliability.

It is another object of the invention to provide a Flash EEPROM orEEPROM device capable of detecting and correcting potential errorsduring the lifetime of the device.

It is yet another object Of the invention to provide a Flash EEPROM orEEPROM device capable of recovering data from read errors which may notbe correctable by available error correction schemes.

SUMMARY OF THE INVENTION

These and additional objects are accomplished by improvements insolid-state memory systems such as EEPROM and Flash EEPROM systems andin techniques that allow simple and effective handling of errors anddefects, so that non-volatile, solid-state memory, even if imperfect,may be used reliably and economically as a mass storage.

In an EEPROM cell capable of storing two or more memory states by virtueof its programmed threshold voltage, any two adjacent memory states suchas "0" and "1" may be distinguished by whether or not the programmedthreshold voltage level is below or above a corresponding pair ofdemarcation voltage levels. One important aspect of the invention is thecontinual monitoring and correction of the memory cells' thresholdvoltage levels, in recognition of their tendency to shift and possiblyconverge towards the demarcation voltage levels during normal operationsof the memory. In other words, the nature of the EEPROM or Flash EEPROMdevices is such that the threshold window spanned by the minimum andmaximum threshold voltage levels tends to close with use, and theinvention provides a dynamic scheme for resisting this trend byresetting the shifted threshold levels to their original intendedlevels.

Under normal operation, a two-state memory cell, for example, is erasedwith sufficient margin below a demarcation threshold voltage level to ahard "0" or programmed with sufficient margin above the demarcationvoltage level to a hard "1". As the memory device is used, the thresholdlevel of a cell not subjected to erase or program operations may losemargin, thereby producing a soft error not readily detectable by normaloperations of the device. The invention provides a scheme forcontinually "scrubbing" the sectors in the array to maintain all cellswithin the proper margins. When a sector is scrubbed, its cells aretested to determine if their margins are maintained, and if not, theyare rewritten with the proper margins.

An important mechanism for occurrence of soft errors is due to normaloperations in one area of the memory device creating perturbations inother areas. The perturbation may cause electric charge to leak eitherinto or out of the floating gate of the perturbed cells, again resultingin a shifting of the programmed threshold voltage levels. If enough ofthese perturbations accumulate on a cell, a hard error could develop.

In one embodiment, the scrubbing scheme is applied to a memory arraythat is organized into a plurality of independently erasable andprogrammable. sectors of memory cells, wherein operations such asprogramming on one or a group of sectors may disturb or shift thethreshold levels in cells of other sectors in the array. The inventioncalls for the "scrubbing" of at least one of the other sectors everytime after it has been subjected to a predefined number of potentiallydisturbing operations. In this way statistically, no sector in the arraywill likely be exposed to such potential disturbances more than apredetermined number of times, even if it never had occasion to beerased or programmed. This is because it will eventually be visited bythe scrubbing treatment which checks the cells of the sector for propermargining and rejuvenate them by re-writing the data with a propermargin if required.

When the disturbing operation is a write operation, the scrubbingoperation is applied to a memory array after a predefined number ofwrite operations performed on a portion of that array. Preferably one ormore sectors other than ones that have been just written are scrubbed.

When the disturbing operation is a read operation, the scrubbingoperation is applied to a memory array after a predefined number of readoperations performed on that array. Either the sector that has been reador another randomly chosen sector is scrubbed.

The "scrubbing" technique as prescribed by the invention is a verypowerful, yet simple way to improve the reliability and extend thelife-expectancy of the EEPROM or Flash EEPROM devices. By "amortizing"the scrubbing operation of the whole array over many normal operations,the memory device suffers very little degradation in performance.

According to another aspect of the invention, a read margining techniqueis used to recover data in a set of cells where the number of harderrors exceeds the capacity of the available error correction means.This applies to the case where the threshold levels of a number of cellsin the set have shifted too close to or beyond the demarcation thresholdvoltage level that the original states can no longer be read correctly.These hard errors become unrecoverable when their number exceeds thatwhich could be corrected by the error correction code (ECC) implementedfor the set. The read margining technique calls for a trial and errorprocess of reading the set of cells in question each time with arepositioned demarcation threshold voltage level displaced about thenormal level in both directions until the number of hard errors isreduced sufficiently for ECC to effect recovery of data.

Additional objects, features and advantages of the present inventionwill be understood from the following description of the preferredembodiments, which description should be taken in conjunction with theaccompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1a is a schematic representation of a single EEPROM cell.

FIG. 1b is a cross-sectional view of an EEPROM device integrated circuitstructure.

FIG. 2a illustrates a two-dimensional array of EEPROM cells.

FIG. 2b illustrates the partitioning of an array of memory cells into aplurality of flash sectors in the preferred embodiment.

FIG. 3 lists exemplary voltage values for normal operations of thememory devices according to the preferred embodiment.

FIG. 4 is a schematic diagram illustrating the functional blocks of thememory device according to the preferred embodiment.

FIG. 5 is a schematic diagram illustrating the functional blocks of amemory controller for controlling a memory device.

FIG. 6a is a schematic diagram illustrating one implementation of a readcircuit for a memory cell.

FIG. 6b is a schematic diagram illustrating another implementation of aread circuit for a memory cell.

FIG. 7 is a flow chart illustrating generally the steps of an eraseoperation according to the preferred embodiment.

FIG. 8 is a flow chart illustrating generally the steps of a programmingoperation according to the preferred embodiment.

FIG. 9 is a flow chart illustrating generally the steps of a scruboperation according to the preferred embodiment.

FIG. 10 depicts exemplary relative margining voltage levels forperforming the different operations according to the preferredembodiment.

FIG. 11 lists exemplary values of margining voltage levels forperforming the different operations according to the preferredembodiment.

DETAILED DESCRIPTION OF THE EMBODIMENT

FIG. 1a illustrates schematically a typical EEPROM cell 10 having afloating gate 12, a source 14, a drain 16, a control gate 18 and anerase gate 20. An exemplary EEPROM structure is generally illustrated inthe cross-sectional view of FIG. 1b. The memory cell 10 is formed on alightly p-doped substrate 30. The source 14 and the drain 16 are formedas two heavily n-doped implanted regions. The floating gate 12, and thecontrol gate 18 are generally made of polysilicon material and insulatedfrom each other and from other conductive elements by regions ofdielectric material 19.

The memory cell 10 is programmed by transferring electrons from thesubstrate 30 to the floating gate 12. In the example cell of FIG. 1b,the electric charge in the floating gate 12 is increased by electronsforced across the dielectric region 19 from the channel 32 near thedrain 16 and into the floating gate 12. Electric charge is removed fromthe floating gate 12 during an erase operation through the dielectricregion 19 between it and the erase gate 20. A preferred EEPROMstructure, and the process for its manufacture, are described in detailin co-pending U.S. patent application Ser. No. 323,779 of Jack H. Yuanand Eliyahou Harari, filed Mar. 15, 1989, which is incorporated hereinby reference.

The amount of electric charge stored in the floating gate 12 can bedetected by sensing the change of the threshold voltage, which increaseswith the amount of electric charge on the floating gate 12. Depending onthe characteristics of the memory cells and system, a range of thresholdvoltage levels is possible, defining a threshold window spanned by amaximum and a minimum level. Thus, it is possible to partition thethreshold window into two or more portions with demarcation thresholdvoltage levels and to define two or more memory states. Multi-stateEEPROM devices have been disclosed in co-pending U.S. patent applicationSer. No. 508,273 of Mehrotra et al., filed Apr. 11, 1990, which isincorporated herein by reference.

For the purpose of discussion and illustration, reference willhereinafter be made to a two-state memory cell. However, it should beunderstood that generalization to more than two states are equallyapplicable and contemplated.

As an example, a memory cell may have a threshold window ranging from 1Vto 9V. A demarcation threshold voltage level of 5V may be used todemarcate between "0" and "1" states in a two-state memory cell.Generally, a good "0" state may have been erased to a threshold voltageof 3V with adequate margin to spare. Similarly a good "1" state may havebeen programmed to a threshold voltage of at least 6V.

The detection can be performed by measuring the source-drain currentI_(DS) between the source 14 and the drain 16 in the presence of areference or predefined voltage applied to the control gate 18. If apotential difference is applied between the source and the drain 16 whena 5V is applied to the control gate 18, the drain-source current I_(DS)will be higher when the cell 10 is in the "erased" state than when it isin the "programmed" state.

The various aspects of the present invention are typically applied to anarray of flash EEPROM cells in an integrated circuit chip.

FIG. 2a illustrates schematically a two dimensional array of EEPROMcells 201. Each cell 10 has a structure similar to the one shown in FIG.1a or 1b, with a control gate, a source, a drain and an erase gate. Thearray of individual memory cells 201 are organized in rows and columns.The control gates of the cells in each row are connected by a word line50, such as W₁, W2. The cells along each column have their sources anddrains individually connected to a pair of bit lines, such as B₀, B₁ forcolumn 61 and B₁, B₂ for column 62. Each cell is addressable byenergizing the word line and the pairs of bit lines to which it isconnected.

FIG. 2a illustrates a preferred architecture in which the drain of onecell such as 61 is connected to the source of the adjacent cell. Thus,the column 61 has the bit line B₀ as the source line (SL) and the bitline B₁ as the drain line (DL). Similarly, for the column 62, the bitline B₁ is the source line and the bit line B₂ is the drain line.

An erase line, such as E₁, E₂, is connected to the erase gate of eachcell in a row. In flash EEPROM devices, the erase gates of a sector (tobe described hereinafter) of several rows of cells are typicallyconnected in common so they can be erased together "in a flash".

As a specific example, a row of cells provides 16×8 bytes, or 128 bytesof data. Extra cells are redundantly provided for re-mapping in casecertain cells become defective.

FIG. 2b illustrates the partitioning of an array of memory cells into acolumn of flash sectors in the preferred embodiment. In conformity withdisk operating systems in which data is programmed or erased in 512 bytesector at a time, each flash sector such as 80, 82, . . . is formed by 4rows of cells, forming 512 bytes of data. For example, the erase gates,such as E1, E2, E3 and E4, of a sector of cells, such as 80, are tiedtogether so that they are erased in a single erase operation as in atypical magnetic disk drive. Typically programming operations are alsoapplied to a sector of cells at a time.

FIG. 3 provides exemplary voltage conditions of a Flash EEPROM systemfor normal memory operations. In performing a read, program or eraseoperation, each cell is addressed by selectively applying predefinedvoltages to its word line, source line and drain line. For, example, inperforming a read operation, 5 volt is applied to the word line, alongwith 0 volt on the source line (SL) and 1.5 volt on the drain line (DL).The bit lines of cells not involved in the read operation are floated.To program a "1" into a cell, 12 volt is applied to its word line, WK,and 0 volt is applied to its source line (SL) and 8 volt is applied toit drain line (DL). The bit lines of cells not involved in the programoperation are floated. In performing an erase operation, 20 volt isapplied to its erase line.

Program Disturb

In a memory array organized into a column of flash erasable sectors ofmemory cells, a problem which may be termed "program disturb" can be themechanism for creating soft errors. The memory array is a twodimensional matrix of memory cells, where each flash sector is formed byone or more rows of cells and the source and drain of each cell down acolumn is respectively interconnected by a bit line. Thus, the same setof bit lines run through all the column of sectors. To program a cell, avoltage must be applied across its drain and source through the bitlines. At the same time, the cell must also be activated by a voltage toits control gate. For example, the column containing the cell is enabledby having its drain line raised to a high voltage of 8 volt relative toits drain line. The cell to be programmed is addressed by applying 12volt pulses to its word line. Other cells within the same column are notaddressed since their word line is at zero potential; nevertheless theymay be affected by the program operation of the addressed cell becauseof the high voltage on the common drain line. This may induce electriccharge leakage in these non-addressed cells, resulting in either a gainor loss of electric charge in their floating gates depending on theelectrical mechanism.

In a memory array consisting of a column of independently erasable andprogrammable sectors, the effect of "program disturb" is to create softerrors by shifting the threshold voltage levels of the cells in othersectors not being programmed. This effect is cumulative, and willeventually result in the soft errors developing into hard errors. It hasbeen estimated that after about 10⁶ to 10₇ such disturbances, a memorycell may develop a hard error. In a sector, the hard errors that developand which cause data to be misread may initially be corrected by ECC,but as more of them developed, they may become uncorrectable as thecapacity of ECC is exceeded.

In the case of conventional Flash EEPROM, the whole array is flasherased and programmed every cycle. Any area of the array is at mostdisturbed once before it is re-programmed and consequently "programdisturb" is not a major concern.

FIG. 4 is a block diagram of a memory device 141 in which the variousaspects of the present invention are implemented. The memory device 141is essentially similar to that of the co-pending Mehrotra et al., U.S.patent application Ser. No. 508,273 which was earlier incorporatedherein by reference.

The memory device 141 shown in FIG. 4 comprises the flash EEPROM arraydevice 201 as illustrated in FIG. 2. The memory device 141 receivescommands (e.g. read, program, erase) from a memory controller 133, alongwith the corresponding addresses and data (if any) through an interface135.

The commands, addresses and data are received by a protocol logic 205with the memory device 141. The protocol logic 205 decodes a commandinto control signals 345, including a program signal, a erase signal (E)and a read signal (R). In accordance with the present invention, a"program verify (PV)" signal, a "program read high (PRH)" signal, aprogram read low (PRL) signal, a "scrub read high (SRH)" signal and ascrub read low (SRL) are generated along with the program signal fromthe program command. Also in accordance with the present invention an"erase verify (EV)" signal is generated along with a erase signal froman erase command.

In one embodiment, a plurality of voltage sources 245 are received bythe memory device 141 from the memory controller. The voltage sourcesare sent to the power control unit 207. The power control unit 207 alsoreceives the control signals identified in the previous paragraph fromthe protocol logic 205.

In response to the control signals from the protocol logic 205,appropriate ones 346 of these voltages is output from the power controlunit 207 for use in a specific memory operations. These voltagesinclude: read voltage (V_(R)) which is enabled by the read controlSignal (R), a program verify voltage (V_(PV)) which is enabled by theprogram verify (PV) signal, a program read high Voltage (V_(PRH)) whichis enabled by the program read high (PRH) signal, a program read lowvoltage (V_(PRL)) which is enable by the program read (PRL) signal, ascrub read high voltage (V_(SRH)) which is enabled by the scrub readhigh (SRH) signal, a scrub read low voltage (V_(SRL)) which is enabledby the scrub read low (SRL) signal and an erase verify voltage (V_(EV))which is enabled by the erase verify (EV).

Data received from the memory Controller 133 for a program operation aregated into a write circuit 211. A buffer is provided within the writecircuit 211 for storing the incoming data so that it can be used duringthe program operation, as well as for verifying that the data havesuccessfully been programmed into the memory array 201, as will bedescribed hereinafter.

The data stored in the memory array 201 are retrieved by a read circuit213 during a read operation or during one of the verify operationsperformed after a program operation, as will be described hereinafter.Data read from the memory array 201 are stored in a shift register 219so that they can be sent out to the memory controller 133.

A compare circuit 217 is provided in the memory device 141. This comparecircuit 217 can be used, during a verify operation, for comparing thedata sent from the memory controller 133 to be programmed into thememory 201 (which has been stored in the buffer of the write circuit211) with the data that have actually been stored into the memory array201 for the program operation.

An erase circuit 215 is provided in the memory device 141. This erasecircuit 215 receives a erase signal from the protocol logic 205 andgenerates the erase voltage to the erase lines of the addressed array ofcells in the memory array 141.

Addresses received from the memory controller 133 are decoded into wordaddress signals which are gated into a sector/row tag latch 223 andcolumn address signals which are gated into a column decode 224. Duringoperation of the memory device 141, the sector/row tag latch 223 and thecolumn decode 224 operate to connect the voltages coming out of thepower control 207 to the proper word lines and bit lines of the memoryarray 201.

For example, during a read or program operation, the control gatevoltage V_(CG) is connected to the word line of the addressed cell. Atthe same time, source voltage V_(S) and drain voltage V_(D) areindividually connected to the source line and drain line of theaddressed cell(s). Similarly, in erase operations, the erase voltageV_(EG), is connected to the erase lines of the addressed cell(s).

In read operations and in verify operations following a programoperation, the memory state of an addressed cell is read by sensing theconduction current I_(DS) through its source and drain when a predefinedV_(CG) is applied to the control gate. The drain current is sensed bysense amplifier in the read circuit 213.

FIG. 5 is a schematic illustration of the functional blocks of thememory controller 133. These functional blocks include a peripheralinterface 415 providing interface to a host computer system (not shown)and a buffer memory 413 for temporarily storing data to or from the hostcomputer system. An ECC logic 416 is provided for generating errorcorrection codes during a program operation, which will be stored alongwith the data in the memory device 141 shown in FIG. 4. The ECC logic416 also operates to check the correctness of the data using these codesduring a read operation.

The functions in the memory controller 133 may be implemented either byhardware or by software stored in a memory within a memory control logic401, to be executed by the logic 401 in appropriate times. In additionto these functions, the controller 133 also has means 405 for providingvarious voltages required for the operations of the memory device 141.

In one embodiment, all voltages needed by the memory devices 141 for thedifferent operations are generated by the memory controller 133. Inanother embodiment, only a subset of these voltages are sent to thememory device 141, and other voltages needed for particular operationsare generated by the memory device internally, using the voltagereceived from the memory controller 133.

During a read operation, incoming data from the memory device 141 arestored in the buffer memory 413 within the controller 133. While thedata are being received, the ECC logic 416 is activated. The ECC logic416 operates to calculate an error correction code for the datareceived. When the data is completely received into the buffer 413, theECC logic 416 will compare the ECC stored along with the retrieved dataagainst the calculated ECC. If an ECC error is detected, appropriateactions will be taken; and the corrected data will be sent to the hostcomputer system.

FIGS. 6a and 6b illustrate schematically how the logic states of amemory cell is sensed.

FIG. 6a illustrates a memory cell 601 whose control gate is connected tothe output of a multiplexor 602 which can be considered as part of thepower controller unit 207 shown in FIG. 4. The multiplexor 602 receivesat its inputs the several voltage sources 245 and the several controlsignals 345 described hereinabove in reference to FIG. 4.

As previously disclosed, the source terminal of the memory cell 601 isconnected to a source bit line 603 and the drain terminal is connectedto a drain bit line 604. The drain bit line 604 is also connected to oneinput of a comparator 605. The other input of the comparator 605 isconnected to the output of a reference cell 606. The function of thereference cell 606 is described in the co-pending application Ser. No.508,273 of Mehrotra et al., incorporated herein by reference earlier.

The reference cell 606 generates a reference current I_(REF) to thecomparator 605 for comparing the source current from the memory cell601. In reading the memory cell 601, the drain bit line 604 will beprecharged to provide 1.5 volts across the source and drain of thememory cell 601.

Depending upon the operations to be performed, different voltages areapplied to the control gate of the memory cell 601. However, thedrain-source current I_(DS) will be higher when the cell 601 is in the"erased" state than when it is in the "programmed" state. The I_(REF) isset at a point between the two levels of I_(DS). Thus, when the cell 601is "0" or "erased", a first voltage signal will be output from thecomparator 605 because I_(DS) is greater than I_(REF). And when the cell601 is "1", a second, different voltage signal will be output from thecomparator 605 because I_(DS) is smaller than I_(REF) .

In accordance with this invention, different voltages are applied to thecontrol gate for different operations. For example, for a normal readoperation, V_(R) (which is 5 volt in the preferred embodiment), isapplied. After the memory cell 601 has been programmed, a program verifyoperation is performed to read the content of the memory cell justprogrammed. However the voltage V_(PV) for this operation is differentfrom the voltage used in a normal read operation.

Program "1" Verify margining

For example, the voltage used in the program verify operation may behigher than the voltage used during a normal read operation. As thenegative electric charge in the floating gate operates to oppose thevoltage at the control gate, if sufficient negative electric charges areforced into the floating gate so that even a higher voltage can still beopposed, sufficient additional margin can therefore be considered tohave been provided in the program operation. By using a more stringentcondition in the verify operation than in the normal read operation,reliability of the memory is improved.

Erase "0" verify margining

Similar verification is performed after an erase operation. This verifyoperation is performed to ensure that a cell has indeed been erased to"0" with sufficient additional margin for improved reliability. Thevoltage used for this verifying operation is again more stringent, thistime, with a control gate voltage less than the voltage value used in anormal read operation.

Read "0" and "1" margining

To further improve the reliability of the memory system, in addition tothe erase and program verify operations, a read margining operation isperformed on the sector of cells after it has been written to verifyboth "0" and "1".

The read margining operation reads the cells to verify that all "1" andall "0" data have indeed been written into the sector of memory cells601 with sufficient margin to pass when they are read with control gatevoltages more stringent than the voltage used in a normal readoperation.

Automatic soft error detection and correction (scrubbing)

As explained earlier, the column of sectors in the memory array sharethe same bit lines and consequently programming of one sector maydisturb the other sectors within the array. To still further improve thereliability of the memory system against such disturbance, after apredetermined number of program operations on a selected set of sectors,a scrub operation will in general be performed on a different set ofsectors in the memory array. The scrub operation uses a control gatevoltage that is again different than the control gate voltage used in anormal read operation to insure sufficient extra margin, therebyproviding failure look-ahead. The scrub operation assures that theprogram operation on the selected set of sectors has not adverselyaffected other sectors of the memory. If a sector fails this scrubmargin check, the data within the sector is rewritten back to thatsector with full write margin levels, with the option of mapping failedbits.

In the preferred embodiment, the scrub operation would read one sectorof cells if less than eight sectors are programmed in a programoperation. If more than eight sectors but less than sixteen sectors areprogrammed, two sectors of cells would be read in the scrub operation.In general, for each eight sectors of cells programmed, one additionalsector would be subjected to the scrub operation.

FIG. 6b illustrates another implementation of the present invention.Instead of applying at the control gate voltages with different valuesfrom the normal read operation, the same voltage can used for thedifferent verify operations and the scrub operation. However, thereference current, I_(REF) , will be changed during the differentoperation. As shown in FIG. 6b, a adjustable current source 608 isconnected in series with the reference cell 606. The output current fromthe adjustable current source 608 is adjusted by the different controlsignals. For example, according to the preferred embodiment, inperforming a program verify operation (in which the "1"s are tested),the adjustable current source 608 would output a current which combineswith the current from the reference cell to form a current smaller thanthe reference current for a normal read operation. In this way, thecurrent produced by the cell 601 would have to have a tolerance from itsnormal value. As another example, in performing an erase verifyingoperation (in which the "0" are tested), the adjustable current source608 would output a current which combines with the current from thereference cell to form a current larger than the reference current for anormal read operation.

FIG. 7 is flow chart illustrating in general the procedure in thepreferred embodiment for performing an erase operation.

In step 701, the addressed cells are erased, by applying a pulse ofvoltage V_(E) to their erase gates, as disclosed in co-pending U.S.patent application Ser. No. 670,246 filed Mar. 15, 1991 by Harari et al.

After the erase operation, an erase verify operation is performed instep 702. In this erase verify operation 702, a voltage V_(EV) isapplied to the control gates of the cells while the data is read. If, instep 703, all the erased cells are indeed "0" even with V_(EV) as thecontrol gate voltage, the erase operation is considered to be successfuland the operation terminates at step 704. Otherwise the erase operationwill be retried in step 701, unless enough number of retries havealready been performed, as determined in step 705, in which case thememory cells will be considered unsalvageable and a defect managementstep will be performed in step 706. In the defect management step, thefailing memory cells may, for example, be mapped out, and the logicaladdresses of those cells re-mapped into a set of redundant cells. Defectmanagement has been variously disclosed in co-pending U.S. patentapplication, Ser. No. 337,566 of Harari et al., U.S. patent application,Ser. No. 422,949 of Gross et al. and co-pending U.S. patent applicationSer. No. 670,246 of Harari et al., all of which have earlier beenincorporated herein by reference.

FIG. 8 is a flow chart illustrating the procedure in the preferredembodiment for performing a program operation. The program operationapplies to previously erased sectors.

In step 801, the memory cells are programmed. As previously described,for cells which are to be programmed with a "1", a 12 volt pulse isapplied to their word lines and 0 volt is applied to the source line and8 volt is applied to the drain line.

Upon termination of the programming step 801, a program verify operation802 is initiated. In the program verify operation 802, the cells areread with a voltage V_(PV) applied to their control gates. A check isthen performed to see if all the cells are written correctly under suchcontrol gate voltage.

The check can be performed within the memory device 141 (see FIG. 4)using the data stored in the buffer of the write circuit 211. The checkcan also be performed in the memory controller 133 utilizing the ECClogic 416 (see FIG.,5) by having the controller 133 read the data.

If the data are read correctly, as shown in step 803, even with V_(PV)applied at the control gate, a program verify 802 operation isconsidered to have performed successfully. (In the preferred embodiment,V_(PV) is higher then the voltage value for a normal read operation, andtherefore only "1"'s are actually tolerance-tested.) If, on the otherhand, the program verify operation. 802 fails, step 803 will cause theprogram operation to be retried, unless enough retry operations (asdetermined in step 809) have been performed. In that case, the cell isdeemed unprogrammable and defective, and a defection management (step810) similar to step 706 of FIG. 7 will be performed.

In steps 804-807, a read margining operation is performed on the memorycells. The read margining operation are subdivided into twosub-operations 804 and 806. In the first sub-operations, the controlgate voltage is set to V_(PRH) (which, in the preferred embodiment, ishigher than the voltage value for a normal read operation) and the datais read and compared with the actual data (as described above). Thissub-operation is performed to re-verify that all "1"s are programmedproperly. In the second sub-operation, a voltage V_(PRL) , lower thanthe normal read voltage is used. This sub-operation is performed tore-verify that all "0"s are performed correctly. This is done to insurethat no disturbs occurred to previously written bits within a sectorwhile writing subsequent bits within that sector.

In the preferred embodiment, if one of the two sub-operations fails(steps 805 and 807), the defect management operation will be performed.

To further improve reliability of the memory system, a scrub operationis performed in step 808 after a write operation.

FIG. 9 is a flow chart illustrating generally the steps of a scruboperation.

In a scrub operation, generally a different sector of cells sharingcommon bit lines as the sector of cells involved in a program operation,are tested. The sector to be tested may be chosen randomly. In step 901,a control gate voltage V_(SH), which is higher than the voltage for anormal read operation, is applied. The cells are read in step 902 to seeif there is any error as determined by the ECC check. Steps 901 and 902are performed to verify that the "1"s in the chosen sector of cells areunaffected by the program operation.

In step 903, the scrub operation is performed again on that chosensector, with a control gate voltage V_(SL) lower than that for a normalread operation. The cells are read in step 904 to see if there is anyerror as determined by the ECC check. Steps 903 and 904 are performed toverify that the "0"s in the chosen sector of cells are unaffected by theprogram operation.

In the scrub operation, since the actual data does not reside in thebuffer of the write circuit 211 of FIG. 4, the test can be performed inthe memory controller 133 utilizing the ECC logic 416 of FIG. 5. If thedata in the sector(s) to be scrubbed can be read without ECC errors,they are left alone and the program operation is considered to have beencompleted successfully.

If the scrub operation for the sector(s) being scrubbed fails, then arewrite operation 905 will be performed thereon-to reprogram thesector(s). The steps of this program operation follows the procedure setforth in FIG. 8.

The relative levels of the different voltages used for differentoperations in the preferred embodiment are illustrated in FIG. 10.Example voltage values of these levels are listed in FIG. 11.

Additionally, a scrub during read implementation of the presentinvention, analogous to the scrub during write provides still furtherreliability improvement. In this embodiment, a read under margin toeither the selected sector to be read, or another sector, which may bechosen randomly, is performed periodically, for example following every1000 reads as well as at initial power up. If the sector passes thescrub margin reads, no action is taken, while if it fails, the data iscorrected and rewritten. This provides failure look-ahead, prior tofailing read under the normal operation, adding additional reliabilityto the memory.

Data recovery using margining

The ability to provide different control gate voltages in conjunctionwith ECC implementation is advantageously used in the preferredembodiment to further increase the endurance and reliability of thememory system.

If during a normal read operation, an uncorrectable ECC error isdetected, under memory systems heretofore known, the data would beconsidered Unusable. In accordance with the present invention, thecontrol gate voltage can be adjusted either upward or downward in smallsteps about the normal voltage and the data is re-read. If, at aparticular adjusted voltage, the ECC error disappears or becomescorrectable, the corresponding data can then be recovered. The recovereddata is then written back into the cells so that it can be read infuture using normal operating conditions.

While the embodiments of this invention that have been described are thepreferred implementations, those skilled in the art will understand thatvariations thereof may also be possible. Therefore, the invention isentitled to protection within the full scope of the appended claims.

We claim:
 1. A solid-state memory system capable of recovering from read errors, including an array of memory cells, each cell capable of having its threshold voltage programmed to an intended level within a range supported by the memory system, reading means to determine a cell's memory state by comparing the cell's threshold voltage with a read reference level, wherein through use of the memory system, read errors may be caused by the threshold voltage of one or more cells drifted from its intended level, said solid-state memory system comprising:error checking means associated with data read from individual group of cells for identifying read errors therein; means for adjusting the read reference level before each read operation on said individual group of cells containing read errors, each time the read reference level being displaced a predetermined step from a reference level for normal read, until said error checking means no longer indicates read errors; and writing means for re-writing the drifted threshold voltage of each cell associated with a read error to its intended level.
 2. A solid-state memory system capable of recovering from read errors, including an array of memory cells, each cell capable of having its threshold voltage programmed or erased to an intended level within a range supported by the memory system, reading means to determine a cell's memory state by comparing the cell's threshold voltage with a read reference level, wherein through use of the memory system, read errors may be caused by the threshold voltage of one or more cells drifted from its intended level, said solid-state memory system comprising:error checking and correcting means associated with data read from individual group of cells for identifying read errors therein and correcting a predetermined maximum number thereof; means for adjusting the read reference level before each read operation on said individual group of cells containing read errors exceeding said predetermined maximum number, each time the read reference level being displaced a predetermined step from a reference level for normal read, until said error checking and correcting means indicates read errors not exceeding said predetermined maximum number; and writing means for re-writing the drifted threshold voltage of each cell associated with a read error to its intended level.
 3. A solid-state memory system capable of recovering from read errors as claims 1-2, wherein said error checking and correcting means is provided by an error correction code.
 4. In a solid-state memory system including an array of memory cells, each cell capable of having its threshold voltage programmed or erased to an intended level within a range supported by the memory system, wherein hard errors may arise from cells with a threshold voltage drifted sufficiently from its intended level to cause read errors, a method for recovering from said hard errors comprising the steps of:providing an error checking scheme for each of a plurality of groups of cells for identifying read errors therein; adjusting the read reference level before each read operation on said individual group of cells containing read errors, each time the read reference level being displaced a predetermined step from a reference level for normal read, until said error checking means no longer indicates read errors; and re-writing the drifted threshold voltage of each cell associated with a read error to its intended level.
 5. In a solid-state memory system including an array of memory cells, each cell capable of having its threshold voltage programmed or erased to an intended level within a range supported by the memory system, wherein hard errors may arise from cells with a threshold voltage drifted sufficiently from its intended level to cause read errors, a method for recovering from said hard errors comprising the steps of:providing an error checking and correcting scheme for data read from individual group of cells for identifying read errors therein and correcting a predetermined maximum number thereof; adjusting the read reference level before each read operation on a group of cells containing read errors exceeding said predetermined maximum number, each time the read reference level being displaced a predetermined step from a reference level for normal read, until said error checking and correcting means indicates read errors not exceeding said predetermined maximum number, thereby allowing said error checking and correcting means to correct the read errors; and re-writing the drifted threshold voltage of each cell associated with a read error to its intended level.
 6. A method for recovering from said hard errors in a solid-state memory system as in claims 4-5, wherein said error checking and correcting means is provided by an error correction code.
 7. A solid-state memory system, comprising:a plurality of memory cells; a read circuit that determines a memory cell's memory state by comparing the cell's programmed threshold voltage with a read reference level set at a predefined level; a read error detection circuit operated in conjunction with said read circuit that produces a predefined read error indication upon detection of a predefined read error when reading data from individual group of cells; and an error recovery circuit activated each time a predefined read error is detected to shift the read reference level of said reading circuit by a predetermined step from a reference level for normal read, until said error detection circuit no longer produces a predefined read error indication.
 8. A solid-state memory system as in claim 7, wherein said error recovery circuit is deactivated when a predefined condition is reached.
 9. A solid-state memory system as in claim 8, wherein said predefined condition includes when said read circuit read the data without indication of read errors.
 10. A solid-state memory system as in claims 8-9, further comprising a write circuit that re-writes said data to said group of cells after said predefined condition is reached.
 11. A solid-state memory system as in claims 7-8, wherein said error detection circuit includes error correction codes. 